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Abstract 

Data Centers (DCs) are required to be scalable to large data sets so as to accommodate ever- 
increasing demands of resource-limited embedded and mobile devices. Thanks to the availability of 
recent high data rate millimeter-wave frequency spectrum such as 60GHz and due to the favorable 
attributes of this technology, wireless DC (WDC) exhibits the potentials of being a promising solution 
especially for small to medium scale DCs. This paper investigates the problem of throughput scalability 
of WDCs using the established theory of the asymptotic throughput of wireless multi-hop networks that 
are primarily proposed for homogeneous traffic conditions. The rate-heterogeneous traffic distribution 
of a data center however, requires the asymptotic heterogeneous throughput knowledge of a wireless 
network in order to study the performance and feasibility of WDCs for practical purposes. To answer 
these questions this paper presents a lower bound for the throughput scalability of a multi-hop rate- 
heterogeneous network when traffic generation rates of all nodes are similar, except one node. We 
demonstrate that the throughput scalability of conventional multi-hopping and the spatial reuse of the 
above bi-rate network is inefficient and henceforth develop a speculative 2-pattitioning scheme that 
improves the network throughput scaling potentials. A better lower bound of the throughput is then 
obtained. Finally, we obtain the throughput scaling of an i.i.d. rate-heterogeneous network and obtain 
its lower bound. Again we propose a speculative 2-partitioning scheme to achieve a network with 
higher throughput in terms of improved lower bound. All of the obtained results have been verified 
using simulation experiments. 
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I. Introduction 

Cloud computing has been employed to supply services in a diverse range from Infrastruc¬ 
ture as a Service (laaS) to Software as a Service (SaaS). Hardware, software and networking 
technologies enable prevalence of cloud computing. Distributed storage which is manifested as 
Data Centers (DCs) is a key technology of providing high performance cloud services and is 
needed to be scalable to large data sets. Previous measurements studies highlight an increasing 
interest for cloud-based storage systems, revealing that remote storage space gains popularity 
among interested users by offloading the complexity of hardware management from the user 
devices [1]. 

In order to enhance the performance of DCs plethora of researches have been done in different 
layers of the DC design. Motivated by the prevalence of millimeter-wave spectrum notably 60 
GHz, we have witnessed studies that propose WDC as a promising solution for small to medium 
data centers due to its favorable attributes such as fast connectivity, reduced cable complexity 
and lower cost [2], [3], [4], [5], [6], [7]. 

Nowadays DCs handle hundreds to thousands of storage devices and thus designing a wire-free 
DC solicits the theoretical bounds of performance of such devices. This calls for the problem 
of performance scalability of WDCs. The problem of the asymptotic throughput of a wireless 
multi-hop network for unicast transmissions has been embarked in the seminal work of [8] and 
followed by extensive researches under different assumptions on the network models. Almost, all 
of the previous studies target on homogeneous network nodes, until recently that a few studies 
investigate some forms of heterogeneity. It has been demonstrated that the rate-heterogeneous 
traffic distribution of a DC is a challenging issue that requires special attention in order to 
construct high performance DCs [9], [10], [11]. This motivates our research to investigate the 
asymptotic throughput of a heterogeneous multi-hop network under non-uniform traffic model 
with high similarity to rate-heterogeneous traffic nature of DCs. 

A. Our Contribution 

In this paper, we investigate the throughput of WDC paradigm and its potential scalability 
through the following aspects. 

1) In order to investigate the applicability of fully WDCs for practical DC applications that 
possess heterogeneous traffic behavior, we pose the question of scalability of WDCs with 
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heterogeneous demand rates. We have found that for a network with n nodes, when 
all nodes exeept one node generate similar data rates (77n(l, 1, • • ■, 5'(?^))), the aggregate 
throughput is lower bounded by order of where rjn is a eommon non-eonstant mul- 
tiplieative faetor appearing in the rate of all nodes. 

2) Then we investigate a multi-hop network when traffie demands of all the nodes are 

different. Let r(n) = (ri(n), r 2 (n),..., r„(n))^ = rin{Xi, X 2 , ■ ■ ■, be the random 
eolumn veetor of the arrival rates of n users, in whieh rj^ is the rate sealing faetor and 
Aj’s are sequenee of i.i.d. random variables drawn from a general distribution. A lower 
bound for the throughput of sueh seenario is suggested as E | |, where Ti is the 

throughput of rate-homogeneous network obtained in Gupta and Kumar [8]. 

Previous observations reveal that traffie in DCs is ON/OFF in nature with properties that 
fit the heavy-tailed distribution [4], [10]. Motivating by this observation we show that the 
aggregate throughput of network with heavy-tailed traffie distribution with parameter a 
seales like with high probability. This result indieates that the performanee of 

WDCs under the mentioned traffie pattern does not seale with n and thus is not eompetitive 
with nowadays wired DCs proposals.. 

3) Thus we propose a speeulative 2-partitioning seheme so as to improve the performanee of 

eonventional multi-hopping and then obtain an improved sealing for its throughput. When 
heterogeneous traffie ... ,g{n)), and when ^/n « g{n) « n, the aggregate 

throughput of our proposed seheme improves to n/g{n). 

4) Again and in order to be eompetitive with linear sealability of wired DCs, we employed a 
speeulative 2-partitioning seheme for fully i.i.d. rate-heterogeneous traffie. The lower bound 
of the throughput is improved to 72(«^+2«-4)/{2a2+2a) traffie rates follow heavy-tailed 
distribution with parameter a. 

5) Simulation of sealing law is trieky and most previous studies target only mathematieal 
approaehes [8], [12], [13], [14]. Few studies attempt to simulate the throughput against 
network size [15], [16], [17], [18]. However, exeept [15], [18] they report the results 
only for low size networks and do not mention the diffieulty of simulation of large 
networks. We designed a teehnique to simulate the rate and then using this teehnique 
eonduet extensive simulations for different network sizes and traffie patterns to validate 
the network throughput under the above-mentioned seenarios. 
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B. Structure 

The rest of the paper is organized as follows. Seetion II provides related work. In Seetion 
III, we deseribe the system model. Seetion IV gives the asymptotie theoretieal results of WDCs 
when the number of nodes grows . Section V presents simulation experiments and numerical 
results. Section VI, outlines directions of future research. Finally, Section VII summarizes our 
findings and outlines directions of future research. 

II. Related Work 

This section reviews the progress of research in two directions. 

A. Wireless Data Center (WDC) 

Wireless technologies that have been initially presented in [5] serve as a detour link between 
Top of Racks (TORs) in a DC to mitigate the congestion condition of switches and improve 
maximum transmission delays [4], [19], [20]. Besides using wireless link as a detour, the 
feasibility of completely wireless data centers have been investigated in research community 
so as to mitigate the high wiring costs, performance bottlenecks, and low resiliency to network 
failures of wired data centers. 

[2] attempts to address practical issues in realizing a WDC by proposing a hybrid wired/wireless 
architecture and scheduling wireless links in a distributed manner. The architecture has been 
modeled and an optimization problem has been formulated to schedule the links, and to trade 
off complexity for practicality a heuristic algorithm is presented. 

Recently, a methodology for building wire-free data centers based on 60-GHz radio frequency 
(RF) technology has been presented. Exploring the design space demonstrates the potentials of 
fully WDCs with respect to some major performance measures [7]. 

Multiple-input multiple-output (MIMO) link design scheme for WDC applications has been 
studied in [21]. The impacts of MIMO degrees of freedom have been explored in a multi-node 
packet networking environment. 

An architecture of DC is presented in [6], which incorporates wireless network cards to 
both servers and routers so as to exploit cooperative traffic and eliminate redundant traffic 
among servers. This scheme reduces link loads and increases the network throughput. Through 
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experiment with prototype equipment the authors explore the use of 60 GHz wireless links to 
relieve hot-spots in oversubseribed DCs [4]. 

As a tradeoff between network performanee and eable eomplexity, the authors in [20] proposed 
RF-HYBRID that employed wired-wireless eollaborated hybrid DC arehiteeture to have the best 
of both worlds. 

B. WDCs Scalability 

Gupta and Kumar in [8], initiated the researeh on wireless network throughput sealing, when 
nodes are randomly and independently distributed with equal rates under unieast traffie. They 
showed that eaeh souree-destination pair ean aehieve a bit rate on the order of 1/ y/n logn when n 
tends to infinity, resulting in logn) aggregate throughput. They also showed the ©(a/w) 

sealing for networks with arbitrary plaeement of nodes. In [22], [23], strategies are proposed to 
aehieve the same bound. 

Franeesehetti et al. in [24], removed the gap between the throughput of randomly loeated and 
arbitrarily loeated nodes, and showed the total throughput seales by ©(a/w). 

What has been mentioned so far, is aehieved by the assumption of no eooperation among 
nodes. Xie and Kumar in [12] investigated the strategy of multi-hop wireless networks with 
nodes eooperation. Ozgiir et al. in [13], proposed an order-optimal seheme, with help of the 
distributed MIMO teehnique. 

The above mentioned works, eonsider wireless networks with homogeneous traffie assumption. 
Toumpis et al. in [14], studied the bounds on the throughput of wireless networks with s sourees 
and destinations (0 < d < 1) all with equal rates. Liu et al. in [25], extend this result by 
eonsidering that not every node has data to send and not every node ean be a destination. In 
[26], the authors extend [13] for the ease in whieh a destination node is the sink for k souree 
nodes, while the rest of the s = n — k nodes partieipate in unieast sessions. Liu and Wang 
in [27], investigate the heterogeneous network with multieast and unieast traffie. In eontrast to 
previous works, we eonsider n unieast traffie sessions with different rates. The traffie models 
that we eonsider in this study are a better mateh with the traffie patterns of praetieal DCs and 
measurements studies eorroborate this elaim [4]. 
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Table I: Notations 


Notation Description 

E{K} Expected value of random variable 5R 
/(5R) Probability Density Function (PDF) of random variable 5R 
F{1R) Cumulative Distribution Function (CDF) of random variable 5R 
(.)^ Transpose of a vector 

n Number of nodes 

r Sources demand rate vector 

Ti Demand rate of source i 

T Network Throughput (Network Capacity) 

Xj{t) The signal transmitted by node j at time t 

Yi{t) The signal received by node i at time t 

Wireless channel gain from the node j to the node i 
Pj The power transmitted by the node j 

7 The path loss exponent 


III. System Model 

In this section we describe the WDC Architecture, channel model and assumptions. A summary 
of notations used in the paper are listed in Table I. 

A. WDC Architecture 

As depicted in Fig.l, the wireless nodes in WDC are deployed in a 2-dimensional mesh that 
has n = n 2 X ni nodes with ni nodes at dimension Z, 1 < Z < 2. The position of each node 
is indicated by a distinct 2-digit mixed-radix vector [Z 2 A 1 ], where ii, I < ii < rii, 1 < I < 2. 
A wireless link (Z, j), between nodes i and j, {i,j G {1,2,... ,n}) exists in the WDC if, in 
the absence of any other transmission in the network, receiver j is in the decode range of the 
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Fig. 1: Two dimensional mesh 


transmitter i. Links are assumed to be symmetric. Thus in the 2-dimensional mesh configuration 
two nodes i = [i 2 ,L] and j = [j 2 , Ji] are connected directly, iff there is an /, 1 < / < 2 , such that 
ii = ± 1 and 4 = jfc for 1 < /c < 2, /c 7 ^ /. Node i = [* 2 , 4 ] 4 a comer node if ii G {l,nz} 

for all 1 < / < 2 . 

Node i = [4,4] is a border node if there is /, 1 < / < 2, such that ii G {1, n;}. Thus, in a 2- 
dimensional mesh excluding the corner and border nodes every node is neighbor with two nodes 
in each dimension resulting in a total of 4 neighbors. In this way each node includes processor, 
memory, storage elements and four 60 GHz directional antenna along with the interfacing circuits, 
or one 60 GHz omni-directional antenna. In order to save the cost we may benefit from the space- 
reuse property of wireless communication and use only one omni-directional antenna per node. 
The details of the technique is presented in the following section. The grid size of the mesh 
(i.e. the distance between two neighboring nodes in a dimension) is do and only neighboring 
nodes are in the decode range of each other. Mesh is not a fully connected graph and thus the 
network is a multi-hop WDC; i.e. packets of end-to-end sessions may require passing through 
one or more intermediate nodes. 



























B. Channel Model 


Assume each node is a data source for exactly one other destination, and a data destination 
for exactly one other source, i.e. a unicast scenario. Thus for a network with size n we have 
exactly n traffic sessions indexed by i, 1 < i < n, each with their end-to-end packet flows that 
need to be transported by the network. Session i which is denoted by SDi, has source node Si 
and destination node Di. Thus we have the following set of sessions (Source-destination pairs): 

{SDi,i = l,...,n} (1) 

Source-destination pairs are randomly sampled from the set of all possible source-destination 
pairs using a spatial uniform distribution. rj(n) is the arrival rate (demand rate) of session i. Let 
r(n) = (ri(n), r 2 (n),... ,r„(n))^ be the column vector of the arrival rates of the sessions SDi 
to SDn- Worth to mention that the rate vector in scaling regime depends on the network size and 
due to this fact we present the rate vector as r(? 7 ,). For the simplicity, in what follows we drop the 
dependency of the rate vector r(?7,) to n from rate vector i.e. we use r = (ri, r 2 ,..., r„)^ instead 
of r(n) = (ri(n), r 2 (n),..., r„(n))^. Each session uses X-Y routing to reach its destination, 
which is described in Section IV-A. Network throughput is defined as the sum of all source- 
destination pair rates: 

n 

T = ( 2 ) 

i=l 

Let dij be the Euclidean distance of the nodes i and j (see Eig.2), Xj(t) be the signal transmitted 
by node j at time t, Yi{t) be the signal received by node i, hj^iit) be the wireless channel gain 
from the node j to node i and Zi{t) be the Additive White Gaussian Noise (AWGN) with the 
power Nq, at the time t. 

Multiple sessions may transmit at the same time and thus the signal received at the node i 
can be viewed as the sum of the desired signal, interference, and noise [28]. 

Yi{t) = hj^i{t)Xj{t) -f ^ hk,i{t)Xk{t) -f Zi{t) (3) 

We adopted the physical model and thus, in order to have a successful transmission from the 

node j to the node i, we should have the following Signal to Interference plus Noise Ratio 
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Fig. 2: Grid Network. 


(SINK) satisfaction constraint^ 


QTAff? = __ 




( 4 ) 


where Pj is the power transmitted by the node j at that moment, and /? is a constant threshold. 
Also, we assume a Line of Sight (LoS) model for the wireless channel as follows: 


hj^i — 


exp i) 


d 


' 7/2 

‘hi 


( 5 ) 


in which 9j^i is uniformly distributed on [0,27r), and 7 is the path-loss exponent. We assume 
Pj = P = cte for all j. 


C. Assumptions 

The analysis is based on assumptions that have been widely used in the literature [4], [ 8 ]: 

• The traffic pattern for each session is unicast, i.e. each source only sends to one destination 
node and each receiver is targeted by only one source node. Thus, all sessions are one to 
one and for a network with size n we have n sessions. 

• The network is divided into smaller 3x3 sub-meshes. The time is divided into frames and 
each frame is slotted using 9-TDMA Medium Access Control (MAC) as data link layer 


*For notational simplicity we drop the time index of the variables when clear from the context. 
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protocol in each sub-mesh. This technique enables us to use an omni-direetional antenna 
per node instead of four direetional antennas and henee preserves the eost. 

• The message in eaeh node is divided into packets whieh is being transmitted in the eorre- 
sponding time slot of the node in 9-TDMA frame. 

We eonsider three different rate seenarios as explained in Seetion IV-A to IV-C. 

It is noteworthy to mention that the asymptotie throughput, although are obtained for 2- 
dimensional mesh, is also valid for any deployment of nodes sueh that in a gridded area only 
one node belongs to a grid [29]. Essentially, the results are also valid for the deployment of 
nodes that follows a Poisson point proeess on M" for all dimensions n > 1, i.e., a spatial Poisson 
proeess. Thus, If v{A) is the volume of A C M"^,then the number of points in A follows Poisson 
distribution with mean Xv{A). 

IV. Analysis 

In what follows we derive the asymptotie throughput bound for different network seenarios 
and propose speeulative 2-partitioning that improves the throughput of the network and then 
obtain the new bounds for sueh improved configurations. 

A. Rate-homogeneous throughput 

Let’s denote the uniform demand rate vector as: 

r = ..., 1) (6) 

This means that all souree-destination pairs demand rates are equal to //„. Considering the inter- 
ferenee imposed by eoneurrent transmission of multiple sessions, we aim to find the maximum 
feasible pn so as to attain the maximum network throughput. Clearly, sinee source-destination 
pairing is assumed to be a random event, the network throughput is a random variable. The next 
theorem eharaeterizes the sealing of the average network throughput (It should be noted that this 
theorem reviews a well-known result in the literature originally proved in [8]): 

theorem 1. One can achieve the average throughput E{T} = for the uniform rate 

vector in (6), through multi-hopping. 

Proof: Here we present a simple proof sketeh. For a regirous proof refer to [8]. First we 
explain the elements of the seheme aehieving this throughput briefly: 
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Fig. 3: 9-TDMA Spatial Reuse MAC. 


MAC Protocol: Divide the network into square eells eaeh eontaining 9 nodes. As depieted in 
Fig. 3 the 36 nodes network is divided into 4 square blocks, namely Bi to B^. Inside each Bi, 
label the nodes from 1 to 9. In each time slot only the nodes with the same index as that time 
slot number are allowed to be active. This MAC protocol is called the 9-TDMA spatial reuse 
MAC. For this MAC protocol one can see the following fact: 

lemma 1. If the path loss exponent 7 > 2, then with the 9-TDMA spatial reuse MAC, 0(n) nodes 
can simultaneously transmit their message to their neighbor nodes, while all being successful. 

Proof For proof we refer to [ 8 ]. 

□ 

Routing Protocol: Consider the source-destination pair SDi between source Si and destination 
Di in Fig. 3. We employ X-Y routing and wormhole switching to transfer traffics between each 
source and destination [30]. They have been widely used in contemporary parallel machines due 
to their desirable properties such as low buffering and minimal hardware requirements allowing 
efficient and fast router implementation [30]. In X-Y routing the packet is routed to destination 
one dimension at a time. Within each dimension the packet travels in the preferred direction 
(shortest distance) until it reaches the same coordinate of destination in that dimension. 

In the example in Fig. 3 the packet is firstly routed in dimension X until it reaches the proper 
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Table II: Sequence of packet transmissions of the example in Fig. 3 


Time Slot 

Prev Hop (Block No.) —> Next Hop (Block No.) 

5 

5 (Bi) 6 (Bi) 

15 

6 (Bi) —> 4 (Ba) 

22 

4 (Ba) 5 (Ba) 

32 

5 (Ba) 2 (Ba) 

38 

2 (Ba) 8 (Ba) 

44 

8 (Bs) 5 (Ba) 

50 

5 (Ba) 2 (Ba) 


coordinate in this dimension and then starts routing in the preferred direction in dimension Y 
until reaches its destination Di. We assume that transmissions have the slotted operation and that 
the nodes are synchronized. In each slot one packet can be transferred in the preferred direction 
on each link. Using 9-TDMA the timing diagram of the transmission of the example in Fig. 3 is 
illustrated in Table II. This diagram achieves the highest throughput scaling since it employs the 
reuse throughput of the network. In terms of delay this method is not optimum. For this routing 
and MAC one can prove the following lemma: 

lemma 2. For the X-Y routing described above, average number of hops needed for a packet 
to reach its destination is at least 


Proof See appendix A-A. 

□ 

Using lemma 2 we find that the total required rate of packet transmissions in the network is 
nE{rin}Q{s/n). Using lemma 1 and in order to have a stable network, we should have that the 
total required rate of packet transmissions is equal or less than 0(n). i.e. 

0(n) > X n) X Q{^/n) (7) 


which results in 

E{T} = n X E{?7„} < 0(A/n) (8) 


It means that all E{T} < Q{y/n) are achievable with this multi-hopping scheme. 
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B. Heterogeneous-rate Throughput 

In this section we assume that different source destination pairs require different end-to-end 
transmission rates. 

1) Heterogeneous-rate with one dissimilar rate: First suppose the following rate vector: 


1 , • • •, 


(9) 


r = 


which shows a scenario where all the pairs have equal rate, except the last one, whose rate is 
g{n) times greater than the other nodes. For this example we have the following result: 

theorem 2. With the multi-hopping scheme, and the rate vector indicated in (9), the network 
throughput scales as: 



( 10 ) 


Proof: n—1 nodes need the required rate of order pn and the remaining one node need the 
required rate of order png{n). Thus, we have to wait for the latter node to finish its transmitting 
task. So the aggregate throughput for this scenario is same as the scenario in which all nodes 
have Png{n) required rate. Notice that we assume that when each node finishes it’s packets, 
sends dummy packets to it’s destination. 

■ 

2) Heterogeneous-rate with general i.i.d. distribution: Now we consider a rate vector that 
appears in practical DCs ([4], [10]). Let r = (ri, r 2 ,..., r„)^ = -^ 2 ,..., A„)^ be random 

column vector of the arrival rates of n users. AjS are a sequence of i.i.d. random variables drawn 
from a general distribution. Then we will have the following theorem for the multi-hopping 
scheme introduced in the last section. 

theorem 3. If the throughput for the network with uniform traffic is equal to Ti, then the 
throughput for the network with non-uniform traffic is lower bounded by: 



( 11 ) 


As an example, for the heavy-tailed distribution as given by [31], 


Fx{x) = 1 


1 


,x>l 


( 12 ) 
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the throughput is lower bounded by 


( 13 ) 


Proof: Define the total data volume transmitted from each source to its corresponding 
destination as in the following vector 


(VTi,..., Wn) = Cor7„(Ai,..., A„) 


(14) 


where cq is a positive constant. In other words, the source-destination pair SDi wants to transfer 
Wi bits of data. Again we assume that when each node has finished its packets, it sends dummy 
packets. Then, if we define U as the time needed for this transfer, we will have: 

Wi Cor in CoPnKn 


ti = 


(15) 


Ti/n Ti Ti 

The whole process is finished when the data for all the source-destination pairs is transferred. 
Since the transfer process for different pairs is in parallel, the total time will be: 


CoVnn . 

ttot = max ti = —;;;- mUX \i 


Ti 


and thus the network throughput will be 


To = 




(16) 


(17) 


ttot n maXj A* 

and by considering the expected value of (17) and dummy assumption, we will have (16). For 
the distribution in (12), we will have 


^{Ti} 


n 


l/2-l/a 


= E 


(a) 

(J 

(c) 


ciE 

C 2 E 

C 3 


(maxj Aj/n^/") 

(Ea Aj/n) 
(maxj Ai/n^/“) 

1 

(maXj Ai/n^/“) 


(18) 


which proves (13). In above (a) follows from the fact that throughput of the uniform case is a 
random variable independent of the non-uniform rate vector, and its average scales as ©(-y/n). 
The identity (b) is due to the following lemma: 
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lemma 3. Consider i.i.d. random variables Ai,..., A„ with the c.d.f. indicated in (12). Define 
X = (Yhi^i/n). Then for a > 2 we have: 

E{X} = E{Ai} (19) 

Var{X} ^ 0. 

Proof This is a simple application of the Central Limit Theorem (CLT). 

□ 

For the identity (c) we have used the following lemma. 

lemma 4. 

^ {(maxi Ai/nC“) 

Proof See appendix A-B. 


( 20 ) 

□ 


C. Enhanced throughput of network with heterogeneous rate vector 

The result in the previous section demonstrates that for conventional multi-hopping and the 
spatial reuse, the heterogeneity of traffic demands deteriorates the throughput. Thus it is necessary 
to improve the throughput of wireless DCs so as to be employed as a potential communication 
candidate for practical purposes. We speculatively conjecture that the throughput of hetero¬ 
geneous traffic pattern is upper bounded by homogeneous traffic pattern under conventional 
multi-hopping and the spatial reuse. As we argue in Section IV-B and as bad news we may not 
improve the throughput scaling with methods such as improving routing and MAC protocols. 
As good news, however, we may shift the throughput scaling toward its homogeneous upper 
bound by speculatively 2-partitioning the traffic demands in two parts (part one and part two) 
as delineated below. 

1) Enhanced throughput of network with one dissimilar rate: The first group consists of those 
with lower data rate and the second group consists of the pairs with higher data rate. 

theorem 4. Suppose n — 1 nodes required rate of order rjn cmd the remaining node has the 
required rate of order 7]ng{n). By using our proposed method, the aggregate throughput calculates 



as follow: 
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T = 



gin) « \/n 
y/n « g{n) « n 


( 21 ) 


Proof: At the end of network time, r]n{{n — 1) + g{n)) packets are transmitted by network 
nodes. In the first part of 2-partitioning scheme, n — 1 nodes transmit rjuin — 1) packets with 
throughput y/n. In the next part, one node send rjugin) packet with throughput 1. So we have: 

gn{{n - l)+g{n)) 


T = 




Thus, we have: 


T= lim 


n + gin) 


= 0 


^^ + 9 {n) \g{n) 


n 


n « g{n) << n 


( 22 ) 


(23) 


For g{n) « y/n, (22) tends to y/n when n grows to infinity. 


2) Enhanced throughput of network with general i.i.d. distribution: Let Xfs be i.i.d. random 
variables. We denote A(i), ..., A(„) as the ordered sequence of the Xfs. We define and 

T]nX{n) as the lowest and the highest demand rates, respectively. Thus the rates are ordered as: 

^nA(l) ^ ^ ^nA(n) (24) 

We divide the rates into two sets. 

^low ■{^nA(l), . . . idn^ir)^ (25) 

Cl high ■{^nA(r+l)) • • • )^nA(n)j’ 

The first and the second sets have r and m = n — r members, respectively. Below we discuss 
about index r that maximizes the throughput. Giving the optimal index we obtain two sets that 
we transfer the traffics of the sessions that have members in Ghigh after transferring of the traffics 
of the sessions in Glow ■ It is noteworthy to mention that the traffic of the sessions in each set are 
transfered according to the conventional multi-hopping scheme presented in [8]. The following 
theorem formulates an optimization problem to find index r which maximizes the throughput 
for the heavy-tailed distribution. The theorem is general enough that we may find the maximum 
throughput for any other probability distribution functions of the rate vectors. 
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theorem 5. In the 2-partitioning scheme the network throughput will be lower bounded by 




= n( 


maxE 


Con 


\ rn [cl^yn- + C2^/mX(n) 

and for the distribution in (12), it will result in: 

^ f^{a^+2a-4)/{2a^+2a)\ 


(26) 


(27) 


Co, Cl and C 2 are constant. 


Proof: Define the vector of data volume for the pairs in Glow as follows 


(W'd). 


~ 

(28) 

and for Gugh as 




{W^(r+l), • 


~ ^0 {Vn^{r+1), ■ ■ ■ , Vn^(n) } 

(29) 

Then, for the time needed for each of the pairs to conclude its transmission, for the members 

of Glow, we will have: 

Tr/r 


(30) 

and for Gugh we will have: 




1(1 


z = r + 1,..., n 

(31) 

’ Tm/m' 

Since each phase finishes when all the members of that phase are successful in transferring their 

data, and the transmissions in each phase are concurrent, then, the time needed for concluding 

the first and second phase are: 

h = 

max t(i) 

(32) 


= 

rX(r) 

CoVn rj, 

1 p 



I 2 = 

max tu) 

2 =r+l,...,n 



= 

CoVn rj, 

^ m 
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Thus, for the network throughput we will have: 

Ts = q(^) (33) 

V '^tot J 

\ti + ^2 / 

= n( 

y(rA(^)/T^) + (mA(„)/Tm) J 

When we form the networks in phase one and two, we make the original grid thinner uniformly. 
That is because the pairs with lowest (highest) required data rate are distributed inside the network 
uniformly. Thus, the two sub-networks in two phases are uniformly distributed in space, and 
their aggregate throughput scales with the square root of the number of nodes. Thus, we can put 
Tr = y/r/ci and = y/mjci in (33). This assertion is made precise in the following lemma: 


lemma 5. Consider the original grid consisting of n nodes. Select m of these nodes uniformly 
randomly, and ignore the other nodes (m —)■ cx)). Then the throughput of this new network is of 
order 


Proof See appendix A-C. 

In addition, we can asymptotically put ~ Con which will result in: 

T, = O ' 


= n 


(ciy/rX^r)) + (ciA/mA(„)) 
Con 


□ 


(34) 


{ciy/n - mXi^n-m)) + {ciy/mX(^n)) 

and finally we should choose the design parameter m such that the throughput is maximized 
resulting in 


n = 


( max E < 


CqU 


(35) 


V m \ Ciyjn - mX(n-m) + Ciy/mX(n) 

For the heavy-tailed distribution in (12), from lemma 4, we know that A(„) —)■ as n —)■ oo. 

Also we have the following lemma 


lemma 6. 


A(„_m) — as n —)■ oo 

\m/ 


( 36 ) 











19 


Proof See appendix A-D. 

So, we will have: 


T. = n 


Q 


Con 


Ci\/n — + cw/rnn^/'^ 

Con 


□ 


(37) 


^Cl^/n{n/mY/^ + Ci\/mn^A 

The optimization problem in (35) ean be handled in a simple manner as follows. Sinee the time 
needed for the two phases exhibit a trade-off behavior whieh ean be managed by ehanging m 
(i.e. inereasing m will deerease the time of the first phase, and will inerease the time of the 
seeond phase, simultaneously. ), it can be seen that, the best choice of m happens when we try 
to make the time needed for the two phases equal (in the scaling sense). Thus we will set 

C\\fn{nlvn)^l°' = cwfmvY^'^ 


which will result in 


m = 


by putting (39) into (37) (and after some simple calculations) we will have: 

Ts ~ [^^(«^+2a-4)/(2a2+2a)' 


(38) 


(39) 


(40) 


It should be noted that, in the proof of theorems 4 and 5, we have used the technique of 
introducing dummy information, and hence, the results serve as lower bounds for the network 
throughput. As mentioned above we proved that 2-partitioning of traffic demands improves the 
throughput for network with heterogeneous rate vector. We devised the optimal 2-partitioning 
of the rates to two parts. An interesting line of research that looks NP hard is seeking for a 
potentially k- partitioning of the rates to k parts such that attains true optimal improvement of 
the throughput. 


V. Simulation 

We carried out extensive simulations to evaluate our proposed approaches using OMNET-i-i- 
[32] and MATLAB. Numerous validation experiments have been established. However, for the 
sake of specific illustration, validation results are presented for limited number of scenarios. 
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We adopted 95 percent confidence level to make sure that, on average, the confidence interval 
which is calculated using t-student distribution and standard error contains the true values around 
95 percent of the time. Moreover, Box plots are presented in the figures to illustrate detailed 
behavior of the simulation experiments. The median, the 25th and 75th percentiles are depicted. 
To give additional information about the spread of the results we put vertical lines above and 
below of each box to denote the the 9th percentile and the 91st percentile. Outliers are plotted 
as individual points [31]. 

A. Throughput for network with homogeneous rate vector 

Numerous validation experiments have been performed for several combinations of network 
sizes, and demand rates. For brevity, network sizes of n = 3 x 3 and n = 18 x 18 are presented 
and the nodes are arranged in a mesh with dimension ^/n. In accordance with [8] messages are 
generated at each node according to a rate-homogeneous demand. In agreement with assump¬ 
tions in Section III-C, X-Y routing and 9-TDMA are employed and source-destination pairs 
are determined using a uniform random number generator such that they constitute uniformly 
distributed sessions in the network. 

Each session has 100 packets to transfer. Each simulation experiment was run until all packets 
are delivered to their destination. Eurthermore, we considered a transmission channel without 
packet loss. The simulation experiments were repeated 200 times, each with a new configuration 
of sessions that are uniformly distributed across the network. 

Eig.4a shows the simulation results of throughput against the network size. The Box plot 
denotes the variability of the results in the simulation experiments. The results of aggregate 
throughput of 200 simulation experiments for each network size are obtained as well. These 
results are depicted in Eig.4b for different network sizes using Eog-Eog plot. The fitting line 
slope for this plot is 0.4905 that matches with the slope 0.5 of Eemma 2 with a high degree 
of accuracy. This result reveals that simulation experiments follow the mathematical analysis in 
[8] with high precision confirming the scaling property of the network throughput. Einally, the 
Eog-Eog plot of standard deviation of the aggregate throughput is shown in Eig.4c. This result, 
to the best of our knowledge, has not been reported in earlier studies, revealing the scaling law 
for the variance of the throughput which may be useful for the potential applicability of WDCs 
for streaming and multimedia applications that require stable rates. 
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Fig. 4: Validation of all simulation experiments against the theoretieal results in [8] for different 
network sizes, (b) Log-Log plot of throughput against network size (e) The simulation results 
of the standard deviation rate-homogeneous sealing law of [8]. 


B. Throughput for network with heterogeneous rate vector 

In the next step, we eonsider the previous simulation seenario, with the differenee that n — 1 
nodes send 100 paekets, and just one node send 100 x g{n) paekets, and run for 200 distinet 
souree-destination eonfigurations. 

Table III reports the fitted line slope for aggregate throughput. Aeeording to Theorem 4, 
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Table III: Comparing the theoretieal results of rate veetor 77 ^( 1 , 5 f(n)) for different g{n) 
against simulation results 


9{n) 

n2/3 



Theorem 4 result 




Simulation result 

^ 0.3300 


nO-1656 

Exponent difference 

0.0033 

0.07 

0.0344 


for g{n) = and g{n) = (g{n) >> \/n), aggregate throughput seales like For 

g{n) = scenario, throughput scales like ^/n . As you see in Table III. the simulation results 
confirm Lemma 2 result. 

In the next scenario, all nodes have different demand rate. The demand rate vector r = 
A 2 ,..., An), is according to a heavy-tailed distribution with parameter a. We run each 
simulation until the last packet is received to its destination. The simulation results for a = 2 
and a = 5 for over the 200 simulations, are shown in Fig. 5c and 5d. In the Theorem 3, we 
obtained a lower bound for throughput (fl (nV2-iA)). 

VI. Discussion and Future work 

Using any other MACs other than 9-TDMA cannot improve network throughput scaling as a 
function of the network size. Assuming a hypothetic and ideal MAC that allows every node to 
send its traffic in every time slot, the achieved throughput under this ideal MAC is at most 9 
times higher than 9-TDMA. This improvement of the throughput does not depend on network 
size n, but a constant. Omitting this constant factor results in the same throughput scaling as 
9-TDMA. This is true for routing as well. Even, optimal routing improves the throughput by a 
constant factor, which in turn results in the same throughput scaling as X-Y routing. The above 
statements have been confirmed in the pioneering work of Gupta and Kumar [8]. Thus we require 
a scale-variant solution other than the above scale-invariant improvements to make the throughput 
scalable with network size. This diminishes the WDCs dubiousness as a potential candidate for 
practical purposes. As proved in Section IV-C we noticed that 2-partitiordng improves throughput 
scaling. 
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Fig. 5: Box plot of simulation results of the throughput against network size for the rate sampled 
from heavy-tailed distribution with (a) a = 2.5 (b) a = 5 Log-Log plot of the simulation results 
of the throughput against network size for the rate sampled from heavy-tailed distribution with 
c) a = 2.5 (d) a = 5 


Fig. 6 illustrates the potentials of WDCs eonventional multi-hopping network with spatial reuse 
for different eommunieations sehemes. The figure plots logarithm throughput against logarithm 
network size (Log-Log plot). Fig. 6. (a) shows the linear throughput sealing of Wired DCs. [13] 
proves that wireless eooperative MIMO under rate-homogeneous aehieves this linear throughput 
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log{T) (a) 



- (a) Mathematical bound for cooperative MIMO [13] and Wired DCs 

. (b) Mathematical bound rate-homogenous multihop (r = r?„{i,...,i)) [8] 

—o— 0 —o— (c) Simulation results for r = 77n(Ai,...,A„) 

- (d) Tdie proposed 2-partitioning improved lower bound for r = r/„(Ai,..., An) 

—-(e) Mathematical lower bound for r = r/„(Ai,...,An) 

Fig. 6: The throughput scalability of the wireless network for different communication schemes 
compared to throughput scalability of wired DCs (theoretical and simulation results) 


scaling as well, which makes WDCs comparable to novel wired DCs at least in theory. Fig.6.(b) 
illustrates that the throughput of conventional multi-hop with spatial reuse for homogeneous rate 
r = T]n{l ,..., 1) scales as y/n. This matches with the results obtained in [8]. Fig.6.(e) depicts the 
mathematical obtained for the rate-heterogeneous vector demands (r = r 7 n(Ai,..., An)). It shows 
that the rate-heterogeneity which is a requisite in practical DCs exacerbates the throughput of 
conventional multi-hop with spatial reuse of Fig.6.(b). This makes the potential applicability of 
wireless communication for practical DCs questionable. Fig.6.(c) depicts simulation results for 
r = ■ ■ ■, An) and validates the mathematical analysis. 

Fortunately, as depicted in 6.(d) the 2-partitioning scheme proposed in this paper sheds the 
light that it is possible to move forward to the bounds presented in Fig.6.(b). The graphs for 
fully rate-heterogeneous rate demands (r = T]n{Xi, ■ ■ ■, Xn)) for different parameters a, behaves 
as the results presented in Fig. 6 and thus we do not illustrate them for brevity. Knowing that the 
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wireless eooperative MIMO under homogeneous rate aehieves linear throughput sealing brings 
the hope that this solution may be a potential eandidate for rate-heterogeneous traffie demands 
of praetieal DCs. The problem of determining the throughput of the wireless eooperative MIMO 
and its sealability under rate heterogeneity is an interesting problem that is yet to be investigated. 

When everything seems satisfaetory in terms of throughput sealing, the problem of lateney 
remains still prohibitive. The results presented in Table II reveal that high throughput is aehieved 
in a high amount of time. We ean say that although, inereasing spatial reuse, rises the throughput 
of the network, it grows the number of hops for message delivery and thus translates to high 
lateney. These observations suggest that knowing the insensitivity of throughput sealing to routing 
and MAC protoeols, effieient design of very dense WDCs require eompetent design of every 
layer of protoeol staeks and topology so as to handle both high throughput and low lateney in 
order to eompete with wired DCs eounterparts. 

VII. Conclusion 

We employed the theory of throughput sealing law, whieh is primarily proposed for rate- 
homogeneous demands to investigate the potentials of wireless intereonneetion to deploy WDCs 
as a promising solution espeeially for small to medium seale DCs. We obtained the standard 
deviation of the aggregate throughput for this regime. The results are useful for the potential 
applieability of WDCs for streaming and multimedia applieations that require stable rates. The 
asymptotie throughput of rate-heterogeneous demands is obtained for two different types of 
heterogeneity in order to study the performanee and feasibility of praetieal DCs that possess rate- 
heterogeneous traffie nature. The lower bound for the rate veetor demands 1,..., g{n))) 

is ealeulated as Vt{^/n/g{n)). The E | | lower bound has been obtained for r(?T,) = 

(ri(n),r 2 (n),... ,r„(n)) , where the rates are independently sampled from a heavy-tailed dis¬ 
tribution. 

Both above results indieate that the throughput of WDCs under the mentioned traffie patterns 
does not seale with n and thus are not eompetitive with nowadays wired DCs proposals that 
possess linear sealability. Thus we proposed a speeulative 2-partitioning seheme so as to improve 
the performanee of eonventional multi-hopping and then obtained an improved sealing for its 
throughput. For , g{n)), and when y/n « g{n) « n, the aggregate throughput of 

our proposed seheme improves to n/g{n). For r(?7,) = (ri(n), r 2 (n),... ,rn{n)), where the rates 
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are independently sampled from a heavy-tailed distribution, with parameter a, the lower bound of 
the throughput is improved to fl ^77,(“^+2“-4)/(2a2+2Q;) j Although the results are promising, they 
are not eomparable to nowadays wired DCs. The effeets of MIMO and optimal ^-partitioning 
approaehes are yet to be investigated to answer the potentials of WDCs as an alternative solution 
of praetieal data eenters or even intereonneetion network for general multieomputers. 

Appendix A 
Lemma prooes 

A. Proof lemma 2 

Consider two uniformly randomly ehosen nodes on the grid as in Fig. 7. eonsider a disk around 
one of them with the radius Define rii and n 2 to be the number of nodes inside and 

outside of this disk respeetively. Then the probability that the seeond node is outside of this disk 
is equal to 

^ = 1 -^ = ( 41 ) 

n n n 

= 1 — —)■ 1, as n —>■ cx) 

where Aq and cq are eonstants. Thus the seeond node is, with high probability, outside of this 
disk. This shows that the average distanee of two uniformly randomly ehosen nodes on the grid 
is greater than for any arbitrarily small e. 


B. Proof lemma 4 

In order to prove this lemma we need the following extreme order statistics lemma: 


lemma 7. Consider Ltd. random variables Ai,..., A„ with the c.d.f indicated in (12). Then we 
will have 

Ff{hr,x) ^ G{x) (42) 

where bn = F~^{1 — l/n), and 


G{x) = exp(—X “),x > 0 

and Ff{.) is the c.d.f o/maXjAj. 


( 43 ) 


Proof For the proof refer to [33]. 
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Fig. 7: Proof Sketch. 


Then we will have 


Thus we will have 


Pr{F < x} = Pr I ^max j <x 

= Pr I (^maxAi/6nj < 

= Pr |maxAj < bnX^ 


I i 

= G{x) 


E 


max. 


A*/nV“) 


e4 


/•“ exp(-x-") , 
a / -—-dx 


X' 


0+2 


= cte. 


□ 


(44) 


(45) 


C. Proof lemma 5 

Divide the original network into cells of k nodes. Suppose we select each node to be in the 
final network with the probability m/n, independently. Then we will show that in each cell at 
least one node is maintained in the final network, provided that km/n —>■ oo. The probability of 
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this event is: 



1 - 



m\{n/m)(mk/n) 

n J 


I — ^-kmln 


1 


(46) 


Now, as before we have a grid network of m eells and ean operate it to arrive at the throughput 
■y/m. 


D. Proof lemma 6 

Consider the following lemma: 


lemma 8 (Falk, 1989). Assume that Xi,X 2 ,... are i.i.d. random variables with the c.d.f 
F{x). Define X(i), X( 2 ),..., X(„) to be the order statistics of Xi, X 2 ,, X^. If i ^ 00 and 
i/n ^ 0 as n ^ 00 , then there exist sequences a„ and bn > 0 such that 

X(n—i+l) 


N(0,1), 


(47) 


where ^ denotes convergence in distribution, and N(0,1) is the Normal distribution with zero 
mean and unit variance. Furthermore, one choice for a„ and bn is: 

, Vi 


an = F ^ 

Proof For the proof refer to [33] 


1 ] T bn £( \' 

nj nf{an) 


(48) 


□ 


By applying the above lemma to the distribution speeified in (12), and putting i = m, we will 
have: 


n \ i/“ 

Un I - 1 


m 




(49) 


= m 


^ 0 
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